Approximately Independent Features of Languages
نویسنده
چکیده
To facilitate the testing of models for the evolution of languages, the present note offers a set of linguistic features that are approximately independent of each other. To find these features, the adjusted Rand index (R) is used to estimate the degree of pairwise relationship among 130 linguistic features in a large published database. Many of the R values prove to be near 0, as predicted for independent features, and a subset of 47 features is found with an average R of -0.0001. These 47 features are recommended for use in statistical tests that require independent units of analysis.
منابع مشابه
Exploring Novice Raters’ Textual Considerations in Independent and Negotiated Ratings
Educators often employ various training techniques to reduce raters’ subjectivity. Negotiation is a technique which can assist novice raters to co-construct a shared understanding of the writing assessment when rating collaboratively. There is little research, however, on rating behaviors of novice raters while employing negotiation techniques and the effect of negotiation on their understandin...
متن کاملSignificance of histopathological features of breast carcinoma and its correlation for desision of future therapy
Breast cancer is one of the most common malignancies among women and considered as the first caise of mortality in females suffering from malignant processes.axillary lymph node metastasis (ALNM)is the most important predictor of survival in patient with breast carcinoma.the purpose of this study was to determine the association between the incidence of ALNM and morphologic criteria by univaria...
متن کاملEffect of Holistic vs. Analytic Assessment on Improving Iranian Intermediate EFL Learners’ Writing Skill
In assessing foreign language writing, holistic and analytic scoring can be used to measure a variety of discourse and linguistic features. This study aimed to investigate the possible significant effect of analyt- ic and holistic assessments on improving writing skill among Iranian EFL learners. For this purpose, two groups of intermediate EFL learners, after being homogenized, were divided in...
متن کاملLexical Semantics and Selection of TAM in Bantu Languages: A Case of Semantic Classification of Kiswahili Verbs
The existing literature on Bantu verbal semantics demonstrated that inherent semantic content of verbs pairs directly with the selection of tense, aspect and modality formatives in Bantu languages like Chasu, Lucazi, Lusamia, and Shiyeyi. Thus, the gist of this paper is the articulation of semantic classification of verbs in Kiswahili based on the selection of TAM types. This is because the sem...
متن کاملLanguage-independent Gender Prediction on Twitter
In this paper we present a set of experiments and analyses on predicting the gender of Twitter users based on languageindependent features extracted either from the text or the metadata of users’ tweets. We perform our experiments on the TwiSty dataset containing manual gender annotations for users speaking six different languages. Our classification results show that, while the prediction mode...
متن کامل